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A METHOD OF CALCULATING INTERNAL SIGNALS FOR USE IN A MAP 

ALGORITHM 

FIELD OF THE INVENTION 

This invention relates to the method for calculating internal signals used in a MAP algorithm, 
more particularly, but not exclusively, for use in the max-log-MAP (maximum a posteriori) 
algorithm. 

BACKGROUND OF THE INVENTION 

The max-log-MAP decoding algorithm is a simplified version of the MAP decoding 
algorithm, which is also known as the BCJR decoding algorithm. The max-log-MAP is used to 
decode a convolutional encoder. It is one of the more popular soft output decoding methods. 

A main application of the max-log-MAP decoder is in the decoding of turbo codes, which are 
generated using two parallel concatenated convolutional encoders. Two max-log-MAP 
decoders are used as component decoders in the turbo decoder. Today, turbo codes have 
become established as an important method of channel coding in the third generation wireless 
mobile commxmication technical specification. However, as time progresses and demand for 
high data rate communication increases, the turbo decoder is burdened with the task of 
performing with very high processing speed. Improving the speed of the component decoders 
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of the turbo decoder, i.e. the max-log-MAP decoders, is the key to increasing turbo decoding 
speed- 

The conventional max-log-MAP algorithm will now be described. 

At the input of the max-log-MAP decoder are two kinds of received signals: received 
systematic symbols X and received coded symbols Y. A third possible kind of input signal is a 
lambda X signal from the previous max-log-MAP decoder output if iterative decoding (turbo 
decoding) is applied. These signals are used to compute lambda, or in the case of turbo 
decoding, new lambda values. Several important internal signals of the max-log-MAP are 
defined: a, p, and Y. 

a and p are computed recursively - i.e. the computation of the internal signal a or p at a 
symbol sequence t needs the value of the intemal signal from the previous, for a, or following, 
for p, symbol sequence (t-1 or t+1). 

There are two types of T values: is associated with the probability of a systematic symbol 
X being bit 0, while Y^ is associated with the probability of the systematic symbol being bit 1. 
Y is computed using X, Y and the lambda from the previous max-log-MAP decoder output, 
denoted by >L.p. 

Signal Y is calculated according to a trellis diagram. Table 1 gives and example of how YO and 
Yl, given states m, are calculated in an 8-state recursive systematic convolutional (RSC) 
encoder with the following transfer function: [1, gi(D)/go(D)] where go(D)=H-D^-fD^ and 
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gl=l+D+D^ In the above polynomials, the Ds describe a set of different Hamming distances 
(with respect to all zeros) that can be obtained from a convolutional encoder. Note that, 1 
actually referes to D° (Hamming distance of 0). Here, an 8-state RSC encoder is taken as an 
example. In fact, m can be other states, such as 16 states or 32 states, depending on the 
application. 
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Table 1 : Example of Y values for states m 



The a for symbol sequence t and states m (total eight different states) is calculated using a and 
Y at t-1 from two different states (m° and m^). This is done by first computing an 
lumormalized signal a 

at(m) = max-of-2 { at-i(m°) + Y°t.i(m<') , at.i(m') + YS.i(m^) } (1) 

Using the example of the same RSC encoder used in Table 1, Table 2 gives the values of m° 
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and m corresponding to m, wherein states m° and are selected arbitrarily from eight 
different states m. 

The initial values ao(m) are set to appropriate constants. Usually, ao(0) is set to 0 and {ao(l), 
ao(2),..., ao(7)} are set to a large negative number, for example, -128. 
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Table 2: Example of m° and values corresponding to m for a computation 
After all eight states are computed, all a values are normalized by subtracting a constant A*. 
at(m) = at(m)- At (2) 
At is a function of Ot which can be obtained in several ways. Some of them are 

At = max-of-8 { at(m) }ms{o.i 7} (3a) 

At = ( max-of-8 { at(m) }„e(o,i 7} + min-of-8 { at(m) }„e{o.i 7} ) / 2 (3b) 
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At = at(0) (3c) 

Similarly, p is computed using infonnation from different states (m° and in^) and the symbol 
sequence t+1, and using Y from states m at sequence t. 

First, the unnormalized 0 is computed 

fit(m) = max-of-2 { ptf i(mV TVm), pti-i(m V Y^m) } (4) 

Table 3 gives the values of m" and corresponding to m, using the same RSC encoder. 

The initial value gL+i(0), where L is the frame size, is usually set to 0, while {fiL+i(l), 
fiL+i(2),...,fiL+i(7)} are usually set to a large negative number, for example, -128. 
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Table 3: Example of and values corresponding to m for p computation 
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After all 8 states are computed, all g values are normalized by subtracting a constant Bt which 
is obtained in the shnilar way as At 

pt(m) = fit(in)-Bt (5) 

Lambda is computed using a, p and T of all states. The m° and m^ states for p are the same as 
given in Table 3. 

Xt= max-of-8 { at(m) + YVm) + pt+i(m^) } m6{o,i....,7} 

- max-of-8 { at(m) + YVm) + pt+i(m°) } m6{0.i.....7} (6) 

As can be seen from the equations, two or more symbols of a or p cannot be computed in 
parallel, and therefore must be computed in sequence. However, the sequential computation of 
a can run in parallel with the sequential computation of p. Therefore, as a and .p have exactly 
the same complexity and are independent of each other, from this point, the a and p 
computations are referred as a single entity called a/p computations, for convenience sake. 

Lambda computations, however, are not independent of a/p computations. The lambda 
computation for symbol sequence t cannot begin until a and p of symbol sequence t are 
computed. 

Looking at the algorithm for computing a/p for symbol sequence t (equations (1) - (5)), it can 
be seen that the algorithm can be divided into four dependent sequential stages. In other 
words, one stage caimot begin imtil the previous has been completed. For example, to compute 
a for symbol sequence t, the four stages Sl(t) - S4(t) are 
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Sl(t): Compute YVi(m°) and Y\.i(m^) using Table 1 
S2(t): Compute at(m) using equation (1) 
S3(t): Compute At using one of the equations (3a), (3b), (3c) 
S4(t): Compute <Xt(m) using equation (2) 

It is possible to implement pipelining by starting S2 for the next symbol sequence t+1 after 
S4(t). This is because Sl(t) is independent of any stage at any symbol sequence. Figure 1 
iUustrates an example of the timeline of such an implementation, with the assumption that the 
length of time taken for the stages SI - S4 follows the proportion of T1:T2:T3:T4 = 
1:1:1.5:0.5. 

In general, the length of time taken to complete the stages given L number of symbols is 
T = T1+L(T2 + T3+T4) (7) 

Using the prior art, the pipelining implementation is limited by the normalization computation 
(S3 and S4). In other words, part of the computation (referring to S2) of symbol sequence t+1 
can begin only after the normalization computation in symbol sequence t is completed. 

One way to improve this is to forgo the normalization computation for a few symbols 
(normalization can never be completely eliminated because it is needed to prevent overflow), 
but this would produce different results and slightly increase the number of bits which are set 
aside for the internal signals. 
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It is an object of the invention to provide a method of calculating decoding signals that can 
improve the pipelining implementation. 

SUMMARY OF THE INVENTION 

In accordance with the present invention, there is provided a method of calciilating internal 
signals for use in a MAP algorithm, comprising the steps of: obtaining first decoding signals 
by processing received systematic and received encoded symbols of each symbol sequence of 
a received signal; obtaining unnormalized second decoding signals for the current symbol 
sequence by processing the first decoding signals of the previous sequence and second 
decoding signals of the previous sequence; obtaining imnormalized third decoding signals for 
the current symbol sequence by processing the first decoding signals of the current sequence 
and third decoding signals of the next sequence; normalizing the xmnormalized second and 
third decoding signals; and wherein at least one of said second decoding signals of the 
previous sequence and said third decoding signals of the next sequence are unnoirrialised. 

Preferably, said first decoding signals are of two types, one type being associated with the 
probability of a said systematic symbol being 0 and the other type being associated with the 
probability of a said systematic symbol being 1 . 

Preferably, the step of obtaining current xinnormalized second decoding signals is 
implemented by: 

at(m) = max-of-2 { at-i(m^) + YVi(m^ , fit-iCm^) h- Y\.i(m^) } - Afi 
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where at(in) are said current xinnormalized second decoding signals for states m, at.i(m^) and 
i(m^) are respectively said prior unnormalized second decoding signals, for states and m\ 
Y^-i(m°) and Y\.i(m^) are respectively said two types of said prior first decoding signals for 
states in° and m\ Am is a second decoding signal constant for a previous time period, wherein 
said states and are selected from-said states m. 

Preferably, the step of normalizing said second decoding signal comprises the step of 

calculating: 

at(m) = at(m) - At 

where, at(m) are said current second decoding signals for states m, at(m) are said 
unnormalized second decoding signals for states m. At is a second decoding signal constant for 
the current period. 

Preferably, the step of obtaining current unnormalized second decoding signals is 
implemented by: 

at(m) = max-of-2 { at.i(m^) + Y^-i(m^) , fit-iCm^ + Y\.i(m^) } 

where afyn) are said current uimormalized second decoding signals for states m, at-i(m^) and at. 
i(m^) are respectively said prior unnormalized second decoding signals, for states m^ and m^ 
YVi(m*^) and Y\,i(m^) are respectively said two types of said prior first decoding signals for 
states m^ and m^ wherein said states m^ and m^ are selected from said states m. 

Preferably, said step of normalizing said current second decoding signal comprises the step of 
calculating: 
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tttCm) = at(m) - At - (Ai + A2 + ... + Am) 

where, at(m) is said current second decoding signals for states m, at(m) is said unnormalized 
second decoding signals for states m, and Ai, A2, At-i and At are respectively second 
decoding signal constants for the first to current periods. 

Preferably, said second decoding signal constant for the current period At is one of: 
At = max-of-all states { a^m) } 

At = ( max-of-all states { at(m) } + min-of-all states { at(m) }) / 2 
At = at(0) 

where at(m) is said unnormalized second decoding signals for states m. 

The method of the present invention preferably further comprises a step of setting the initial 
values of said xmnormalized second decoding signals to selected constants. 

Preferably, the step of obtaining said imnormalized third decoding signals is implemented by: 
gt(m) = max-of-2 { fit+i(mV Y^m), gt+i(m')+ Y^(m) } - Bm 

where fit(ni) are the current imnormalized third decoding signals for states m, gt+i(ni^)) and 
gt+i(m^) are respectively two values of the future unnorniEilized third decoding signals for 
states m^ and states m\ Y^t(m) and Y\(m) are respectively said two types of said ciurent first 
decoding signals for states m, Bt-i is a third decoding signal constant for a prior period and 
said states m^ and m^ are selected from said states m. 

Preferably, said step of normalizmg said third decoding signals is implemented by calculating 
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Pt(m) = fit(m)-Bt 

where, Pt(m) are the current third decoding signals for states m, gt(ni) are the current 
unnormalized third decoding signals for states m, and Bt is the current third decoding signal 
constant 

Preferably, the step of obtaming said unnormalized third decoding signals is implemented by: 
fit(m) = max-of-2 { fit+i(mV Y^m), fit+i(m')+ Y^(m) } 

where gt(ni) are the current unnormalized third decoding signals for states m, fit+i(m^) and 
gt+i(ni^) are respectively two values of the future unnonnalized third decoding signal for 
states mO and ml, Y^(m) and Y\(m) are respectively said two types of the current first 
decoding signals, wherein said states m^ and m^ are selected firom said states m. 

Said step of normalizing said third decoding signal ntiay comprise the step of calculating 
pt(m) = fit(m) - Bt - (Bi + B2 + ... + Bt-i) 

where, pt(m) are the current third decoding signal for states m, gt(m) are xmnormalized third 
decoding signal for states m, and Bi, B2, Bm and Bt are respectively third decoding signal 
constants for the first to current periods; 

The third decoding signal constant Bt for the current period may be : 
Bt = max-of-all states { fit(m) } 

Bt = ( max-of-all states { fit(m) } + mm-of-all states { fit(m) }) / 2 
Bt = fit(0) 
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Where gt(ni) are the current unnormalized third decoding signals for states m. 

The method of the present invention preferably further comprises a step of setting the initial 
values of said xaimormalized third decoding signals to selected constants. 

Preferably, both of said second decoding signals of the previous sequence and said third 
decoding signals of the next sequence are uimormalised. 

Preferably, the calculation of the intemal signals is pipelined whereby the calculation for the 
next symbol sequence is commenced once the unnormalized signals for the current symbol 
sequence have been calcxilated. 

The described embodiment of the invention is a modification of the calculations of decoding 
signals a and/or p in algorithm to allov^ for an improved pipelining implementation and 
allows S2(t+1) to be independent of the normalization in t, and can begin computation as soon 
as S2(t) is completed. The described embodiment also retains the same final values of the 
intemal signals (the values after S4) as the prior art. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Embodiments of the invention will now be described, by way of example, with reference to 
the accompanying drawings, in which: 
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Fig. 1 illustrates a prior art timing diagram of the four stages of a/p computation for three 
symbols in a pipelining implementation. 



Fig. 2 illustrates a timing diagram of the four stages of a/p computation for three symbols in a 
pipelining implementation according to a first embodiment of the invention. 

Fig. 3 illustrates a timing diagram of thie four stages of ct/p computation for three symbols in a 
pipelining implementation according to a second embodiment of the invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS OF THE 
INVENTION 

The embodiments of the invention use modified max-log-MAP algorithms and a first 
embodiment will now be described. 

In the modified algorithm, the computations for a/p are modified so that pipelining 
implementation can be faster than that of the prior art. The main feature is to make the 
computation of s/fi of symbol sequence t+1 independent of the normalization of a/g of symbol 
sequence t 

Equation (1) is modified so that at(m) is computed using unnormalized values at.i(m) instead 

of normalized values at.i(m), and a normalized constant Am is added. 

at(m) = max-of-2 { at.i(m°) + Y°t.i(m°) , at-i(m^) + Y\.i(m^) } - A t-i (8) 

Equation (2) remains the same. 
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at(m) = at(m) — At 



(9) 



Similarly, equations (4) and (5) are modified to 



gt(m) = max-of-2 { fiH.i(m°)+ Y°t(m), fiH-i(m')+ Y^,(m) } - B, 



't-i 



(10) 



pt(m) = gt(m)-Bt 



(11) 



The final values of Ot and Pt are still exactly the same as the prior art. 

Following these modifications, the four dependent sequential stages described in the prior art 
are now slightly different. For example, to compute a for symbol sequence t, the four stages of 
the modified equations S lm(t) — S4m(t) are 

Slm(t): Compute TVi(m^ and Y\.i(m^) using Table 1 

S2m(t): Compute at(m) using equation (8) (Am being known firom the previous iteration) 
S3m(t): Compute At using one of the equations (3a), (3b), (3c) 
S4m(t): Compute at(m) using equation (9) 

Compared with the prior art, the length of time taken for these modified stages is almost the 
same. SI and Sim take exactly the same computation time, and so do the pair S3 and S3m and 
the pair S4 and S4m. The length of time taken for stage S2m is slightly increased compared 
with S2, because S2m fiirther subtracts the constant Am- 



As in the prior art, it is possible to implement pipelining by starting S2 for the next symbol 
sequence t+1 after S4(t). This is because Sl(t) is independent of any stage at any symbol 
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sequence. Figiare 1 illustrates an example of the timeline of such an implementation, with the 
assumption that the length of time taken for the stages S 1 - S4 follows the ratio 1:1: 1 .5:0.5. 

However, tfie pipelining implementation can now be improved because S2m(t+1) can begin 
after S2m(t). This is because the normalization computations have been deferred. Figure 2 
illustrates an example of the timeline of the improved pipelining implementation according to 
the first embodiment of the present invention, with the assumption that the length of time 
taken for the stages Sim - S4m follows the proportion of Tlm:T2m:T3m:T4m = 1 : 1 .5 : 1 .5:0.5, 
which is the same as the assumption given in the prior art description except for the second 
stage. 

Stage 2 can start immediately after the end of the previous stage 2. Therefore, in general, the 
length of time taken to complete the stages given L number of symbols is 

Tm = Tlm + T2m-h(T3m + T4m) + (L-l)x(T2m4-T3m + T4m-T3m-T4m) 

= Tlm + LxT2m + T3m + T4m (12) 

By comparing with the prior art, we can substitute Tl = Tim, T2 = T2m/1.5, T3 = T3m, and 
T4 = T4m, and obtain 

T = Tim + L (T2m/1.5 + T3m + T4m) (13) 



The number of times of improvement in speed compared with the prior art is 



T/Tm = 
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( T 1 m + L (T2in/ 1 . 5 + T3m + T4m) ) / (T 1 m + L x T2m + T3m + T4m) 



(14) 



For there to be an improvement in speed, the ratio T / Tm must be more than one. Further 
simplification of T / Tm > 1 yields 

L>H-T2m/(3xT3m + 3xT4m-T2m) (15) 

It can be seen from equation (15) that there will be an improvement in speed as long as the 
frame size is not one, as T2m / (3 x T3m + 3 x T4m - T2m) is likely to be less than one. 

f 

Practically, this means that there will be an improvement at any frame size. 

Using the example of Tlm:T2m:T3m:T4m = 1:1.5:1.5:0.5 in equation (14), the number of 
times of improvement in speed approaches 2 for large L. 

However, the improvement should be smaller in the case where S3 is computed using a more 
complex method such as in equations (3a) and (3b), because a relatively small T3 produces a 
small T / Tm ratio. 

In a second embodiment, the algorithm described as the first embodiment can be slightly 
modified and still retain the properties of independent normalization and same fined values as 
now described. 

Equation (8) and (9) can be rearranged as 



at(m) = max-of-2 { at.i(m*^ + YVi(m^ , at.i(m^) + T^.l(m^) } 



(16) 
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and 

at(m) = at(m) - At - (Ai + A2 + ... + At-i) (17) 

This shortens the procedure in (8) and makes the procedure in (9) longer, but the effect is the 
same. 

Similarly, equations (10) and (1 1) can be rearranged as 

fit(m) = max-of-2 { fit+i(m°)+ Y°t(m). fit4-i(m')+ lC\(m) } (18) 
Pt(m) = fit(m) - Bt - (Bi + B2 + ... + Bm) (19) 

The stages Sim and S3m are stUl the same, but S2m and S4m are slightly different For 
example, for a computation of second embodiment, the four stages are 

Slm2(t): Compute Y°t.i(m") and Y^.i(m^) using Table 1 

S2m2(t): Compute at(m) using equation (16) 

S3m2(t): Compute At using one of the equations (3a), (3b), (3c) 

S4m2(t): Compute at(m) using equation (17), and store (Ai + A2 + ... + At) for future vise (in 
the next symbol sequence t+1) 

Compared with tiie prior art, the length of time taken for these modified stages is ahnost the 
same. SI and SI m2 take exactly the same computation time, and so do the pair S2 and S2m 
and the pair S3 and S3m. S4m2 is only two times longer than S4 because instead of subtracting 
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one constant, S4m2 subtracts two constants (the second one is the stored value from the 
previous S4in2 symbol). 



In general, the length of time taken to complete the stages given L number of symbols is 



Tm = Tim + T2m + T3m + L x T4m (20) 



By comparing with the prior art, we can substitute Tl = Tim, T2 = T2m, T3 = T3m, and T4 = 
T4m/2, and obtain 

T = Tim + L (T2m + T3m + T4m/2) (21) 
The number of times of improvement in speed compared with the prior art is 



T/Tm = (Tlm + L(T2m + T3m + T4m/2))/(Tlm + T2m + T3m + LxT4m) (22) 



For there to be an improvement in speed, the ratio T / Tm must be more than one. Further 
simplification of T / Tm > 1 yields 

L > 1+ (T2m/2) / (T2m + T3m - T4m/2) (23) 



It can be seen from eqxxation (23) that there will be an improvement in speed as long as the 
frame size, L, is not one, as the ratio (T2m/2) / (T2m + T3m ~ T4m/2) is likely to be less than 
one. Practically, this means that there will be an improvement at any frame size. 
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Figvire 3 illustrates an example of the timeline of the improved pipelining implementation 
according to the second embodiment of the present invention. In Fig. 3, it is assumed that the 
length of time taken for the stages Sim - S4m follows the proportion of Tlm:T2m:T3m:T4m 
= 1:1:1.5:1. Using the above proportion in. equation (22), the nxunber of times of improvement 
in speed approaches 3 for large L. 

However, the improvement shoxild be larger in the case where S3 is computed using a more 
complex method such as in equations (3 a) and (3b), because a relatively large T3 produces a 
large T / Tm ratio. 

This algorithm may still significantly faster than the prior art. 

The modification of ct/p computation in the max-log-MAP algorithm as explained in the first 
and second embodiments can also be applied to a BCJR (fuil-MAP) algorithm, or a log-MAP 
algorithm. Furthermore, the algorithm may be applied to the calculation of a or p or both. 

In siunmary, a method for calculating decoding signals in a MAP algorithm, such as max-log- 
MAP algorithm, h£is been disclosed. The above-described embodiihents of the invention are 
intended to be illustrative only. Numerous altemative embodiments may be devised by those 
skilled in the art without departing fi:om the scope of the following claims. 



